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We present the first calculation of the Bayesian evidence for different prototypical single field 
inflationary scenarios, including representative classes of small field and large field models. This 
approach allows us to compare inflationary models in a well-defined statistical way and to determine 
the current "best model of inflation". The calculation is performed numerically by interfacing the 
inflationary code Fieldlnf with MultiNest. We find that small field models are currently preferred, 
while large field models having a self-interacting potential of power p > 4 are strongly disfavoured. 
The class of small field models as a whole has posterior odds of approximately 3 : 1 when compared 
with the large field class. The methodology and results presented in this article are an additional step 
toward the construction of a full numerical pipeline to constrain the physics of the early Universe 
with astrophysical observations. More accurate data (such as the Planck data) and the techniques 
introduced here should allow us to identify conclusively the best inflationary model. 

PACS numbers: 98.80.Cq, 98.70.Vc 



I. INTRODUCTION 

The theory of inflation represents a cornerstone of the 
standard model of modern cosmology EH1 (for a review, 
see e.g. Refs. |5|-l9|). By definition, it is a phase of accel- 
erated expansion which is supposed to take place in the 
very early universe, somewhere between the electroweak 
to the Grand Unified Theory energy scales, i.e. between 
- 10 3 GeV and - 10 15 GeV [13]. Inflation allows us 
to understand several puzzles which plagued the pre- 
inflationary standard model and that could not be under- 
stood otherwise. Without inflation, the standard model 
of cosmology would remain incomplete and highly unsat- 
isfactory. The most spectacular achievement of inflation 
is that, combined with quantum mechanics, it provides 
a convincing mechanism for the origin of the cosmologi- 
cal fluctuations (the seeds of galaxies and of Cosmic Mi- 
crowave Background - CMB - anisotropies) [TT| - il5j and 
it predicts that their spectrum should be almost scale in- 
variant (i.e. equal power on all spatial scales) f^. [HI Il7j 
which is fully consistent with the observations [18| . This 
part of the scenario is particularly remarkable since it 
combines general relativity and quantum mechanics. 

However, the physical nature of the inflaton (the field 
driving inflation) and its relation with the standard 
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model of particle physics and its extensions remain elu- 
sive. Moreover the shape of its potential is not known 
except, of course, that it must be sufficiently flat. This 
is not so surprising since, as mentioned above, the infla- 
tionary mechanism is supposed to take place at energy 
scales larger than typically ~ ITeV, in a regime where 
particle physics is not known and has not been tested 
at accelerators. Another crucial aspect of the inflation- 
ary scenario is how it ends and how it is connected to 
the subsequent hot big bang phase. It is believed that, 
after the slow-roll period, the field reaches the bottom 
of its potential, oscillates and decays into radiation [l9r - 
[22l ]. In this way, inflation is smoothly connected to the 
radiation-dominated epoch. However, the energy density 
at which the radiation-dominated era starts is not ac- 
curately known, although some new constraints on the 
reheating have recently been obtained in Refs. [23rl25| . 

Despite the fact that it has become a cornerstone of 
modern cosmology, inflation is not as observationally con- 
strained as the other components of the standard model. 
To improve on this situation, full numerical approaches 
can be put in place in order to use, in an optimal way, 
the astrophysical data now at our disposal [26-32] . This 
should allow investigations on the "fine structure" of the 
inflationary scenario. This program is particularly timely 
since new high-accuracy astrophysical observations, such 
as the European Space Agency Planck data (33[, among 
others, will be released soon. They will provide an un- 
precedented window of opportunity to learn about infla- 
tion. 
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In this article, we are concerned with the question of 
how to evaluate the performance of a given inflation- 
ary model to explain the data as compared with oth- 
ers. This problem can be dealt within Bayesian infer- 
ence [34| (see e.g. Ref. [35| for an application to infla- 
tionary model comparison). In fact, Bayesian statistics 
can be used at two levels. The first level is to determine 
which model parameter values are favoured by the data 
within a given inflationary model, and this for all models. 
To this end, one needs to compute the model's predic- 
tions for the relevant observables, such as the CMB, the 
galaxy power spectra, etc., and then use the experimen- 
tal data to extract the posterior probability distributions 
of the model parameters given the data and the theoret- 
ical priors. The second level is to use Bayesian inference 
for model comparison. At this level, one has to calcu- 
late, for each model, the global likelihood (also known 
as the evidence, or model likelihood) which is obtained 
by integrating the usual likelihood over all of the model 
parameters' values, weighted by their prior probability 
distribution. The resulting quantity can be used to com- 
pute the posterior probability of the model, given the 
available data, thus updating our prior belief in each of 
the inflationary models in light of the observations. The 
Bayesian approach to model comparison has the advan- 
tage of automatically incorporating a quantitative notion 
of "Occam's razor", i.e. more complex inflationary mod- 
els are assigned a larger posterior probability only if their 
complexity is effectively required to explain the data. 

On the practical side, these two levels in Bayesian in- 
ference can be implemented by adopting appropriate nu- 
merical algorithms to integrate the power spectrum for a 
given inflationary model. This has been routinely avail- 
able for several years now and, in this paper, we use the 
public code Fieldlnf [l8|, H|| H3 ■ This inflationary code 
is then coupled with a CMB perturbation code, such as 
CAMB [38] , and then linked with an appropriate algorithm 
capable of delivering both the posterior distributions for 
each model's parameters as well as the Bayesian evidence 
of each model. The evidence is computed using the pub- 
licly available MultiNest code |39l - t4l| . which implements 
the nested sampling alg orithm, employed as an add-on 
sampler to CosmoMC [42j |. 

On the theoretical side, one has to choose classes of 
scenarios that are representative of the inflationary land- 
scape and that one wishes to analyze. In this article, we 
focus on large and small field models for reasons speci- 
fied in the following. The reheating stage is described via 
the reheating parameter as introduced in Refs. (l8l. [23|. 
Moreover, since the choice of priors is always relevant in 
problems of model comparison, we have paid particular 
attention to their physical motivation and we carefully in- 
vestigate this question both for the parameters describing 
the inflationary potential and for the reheating. 

This article is organized as follows. In the next section, 
Sec.|lTl we present the models studied, paying special at- 
tention to the reheating part and the so-called reheating 
parameter. In Sec. IIII1 we recall the definition of the 



Bayesian evidence, describing in detail how the priors 
on the free parameters characterizing each scenario are 
chosen. We also explain how its calculation is imple- 
mented numerically. Finally, in Sec. IIV1 we present our 
results and discuss their physical implications. Readers 
already familiar with the inflationary models, techniques 
and methods can directly jump to Sec. lfflBl Perhaps the 
most important outcome of our article is that it sketches 
a general method which allows us to quantify and de- 
termine the "best" model of inflation (within the list of 
models considered here). 

II. INFLATIONARY COSMOLOGICAL 
PERTURBATIONS 

In this section, after having briefly recalled how 
the theory of cosmological perturbations of quantum- 
mechanical origin allows us to derive the inflationary pre- 
dictions, we present the scenarios studied here, discuss 
our choice of parametrization and motivate it based on 
physical considerations. 

A. Choosing the Inflationary Potential 

In order to compare inflation with various astrophysi- 
cal observables, one must first determine the power spec- 
trum of the density perturbations defined by the follow- 
ing expression: 

r c (k)^^\c k \ 2 (l) 

where ^ is the comoving curvature perturbation in 
Fourier space and is a conserved quantity on super- 
Hubble length scales 041,111. 

This power spectrum depends on the shape of the 
inflaton's potential, and thus, on its free parameters 
which have to be specified. It is common to describe 
the landscape of possible single field inflationary models 
with three different archetypal classes: large field mod- 
els, small field models and hybrid inflation. This simple 
approach is based on the following considerations. Any 
inflaton potential V(cj)) can always be Taylor expanded 
as 

V^=V ± a (J-j + .... (2) 

According to the value of the coefficients of the expan- 
sion, one obtains different classes of models. If the con- 
stant term Vq vanishes, then one obtains a large field 
model [J, |44| . Instead of restricting ourselves to a mas- 
sive scenario, a simple generalization is to consider an ar- 
bitrary power index p, not necessarily fixed to p = 2 [45| . 
If the constant term is not zero, then one obtains a small 
field model @, H| (with a negative second term) or an 
effective hybrid model 0, [47J (with a positive second 
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term). Again, instead of considering only a quadratic 
term, it is more generic to let the power index unspeci- 
fied. This leads to the three classes mentioned before. 

An important question is whether the other terms of 
the Taylor expansion are under control. This has led to 
a debate on the question of whether vacuum expectation 
values of </> larger than the Planck mass are meaningful 
or not [j| [48|, |4f|. In the simple approach used here, 
we do not take part in this discussion and consider sub- 
as well as super-Planckian vacuum expectation values. 
Moreover, hybrid inflation is an intrinsic multiple field 
scenario (with the above potential, inflation could not ac- 
tually stop) which cannot always be described by a single 
field approach [H^IEU- Indeed, in a multiple field model, 
the presence of entropy perturbations can cause the evo- 
lution of Cfc on large scales and this effect can modify the 
power spectrum during the pre-heating stage. Since this 
type of effect is model-dependent, it must be studied for 
each scenario and, for this reason, it is wiser, in a first 
step, to focus on simpler models. For this reason, we will 
consider in the following only the large and small field 
scenarios having, respectively, the following potentials: 



and 



V(4>) = M 



V[(t>) = M 4 



4 I 



pi 



P 



(large field), 



(small field). 



(3) 



(4) 



Of course, this has to be considered as a first step towards 
a more complete scan of the inflationary landscape. The 
large field model is characterized by two parameters, the 
energy scale M and the power index p. The small field 
potential is characterized by three parameters, M, /i and 
p. We come back to the issue of the prior distributions 
to assign to each parameter in section UlI B[ 



B. Describing the Reheating 

In order to compare an inflationary model with obser- 
vations, we also need to take into account the reheating 
stage which takes place after the end of inflation and be- 
fore the onset of the radiation-dominated era. This is 
compulsory since one needs to know the actual value of a 
physical wavenumbers during inflation from its observed 
value today. For instance, the amplitude of the power 
spectrum P* is measured at a given wavenumber, typi- 
cally fc*/a = 0.05Mpc _1 , where ao denotes the present- 
day scale factor. During inflation, the corresponding 
physical wavenumber is stretched back to 



— = — (1 + z nd)e °- 
a ao 



-N 



(5) 



where z on( j is the redshift at which inflation ended, -/V en( j 
the total number of e- folds during inflation and N = In a 



the number of e-folds at the time considered during infla- 
tion. The quantity fc*/a is uncertain precisely due to the 
existence of the reheating. Assuming instantaneous tran- 
sitions between inflation, reheating, radiation and matter 
era, one can simplify 



1 + Zend = (1 + Z, 
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where "reh" and "eq" respectively stands for the end of 
reheating and the equality between the energy density of 
radiation and matter. The so-called reheating parame- 
ter i? rad [l8|, [23[ describes the evolution of the Universe 
during the reheating stage and is defined by 
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rad 



^end / Pcnd 



Oroh 

such that Eq. © becomes 
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(7) 
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where p ro is the energy density of radiation today 1 . As a 
result, i? ra d encodes all of our ignorance on how the re- 
heating influences the observable inflationary power spec- 
tra. In fact, it is for inflation what the optical depth r 
is for CMB observations. The latter encodes how much 
reionisation of the universe affects the measured CMB 
anisotropies (independently of the details of the reioni- 
sation history, at least at first order) while i? ra d plays a 
similar role for the reheating. As it should be clear from 
Eq. ([7|), i? ra( j quantifies the deviation from a reheating 
era which would be radiation-like. 

In fact, as discussed in Ref. [23j, Eq. ([7]) can be re- 
cast into various equivalent forms. In terms of the num- 



ber of e-folds during reheating AN 
ln(a re h/a e nd), one has 
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(9) 



where u> ro h stands for the mean equation of state param- 
eter 

1 f N ** P(n) 



Wreh 



AiV 



p{n) 



■ dn. 



(10) 



Here P(n) and p(n) are the instantaneous total pres- 
sure and energy density of the universe during reheating. 
This description is completely general since no assump- 
tion about the physical properties of the effective fluid 
dominating the matter content of the universe during re- 
heating has been made. One can also express AA^ in 
terms of u> re h such that 



lni? r 



;ad 



1 - 3w rch 
12(l + w rc h) 
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Preh 
Pend 



The density parameter of radiation today is n ro 
10- 5 h 2 . 



(11) 
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As expected, one can verify explicitly that i? ra d = 1 if 
w reh = 1/3. 



III. BAYESIAN MODEL COMPARISON 

In this section, we briefly review Bayesian model com- 
parison, which we adopt to compare the performance of 
our inflationary models (for further details, see e.g. |yj]). 
As a preliminary remark, we notice that if one seeks to 
determine the most economical description of the infla- 
tionary potential in light of the available data, Bayesian 
model comparison is well suited, in that classical statis- 
tics only allows to reject hypotheses, not to confirm them 
(see also Ref. 52[ for alternative model selection crite- 
ria). Therefore, while some simpler models might be- 
come ruled out in a classical sense (i.e. their parameter 
space can become completely constrained by the data, 
until no viable region remains), classical statistics does 
not allow one to rank the remaining models in any way. 
Bayesian model comparison, with its natural inclusion of 
the Occam's razor effect, is therefore the only available 
tool to quantify in a self-consistent way our preference 
for a specific model. 



A. The Bayesian evidence 

Bayesian model comparison aims at computing the 
posterior probability of a model in view of the available 
data. The fundamental idea behind the procedure is that 
"economic" models that fit well the data are rewarded 
for their predictivity, while models with a large number 
of free parameters that turn out not to be required by 
the data are penalized for the wasted parameter space. 
Therefore, in a Bayesian sense, the "best" model is the 
one that achieves the best compromise between quality 
of fit and simplicity. One of the attractive features of 
Bayesian model comparison is that it automatically em- 
bodies a quantitative version of Occam's razor, i.e., the 
principle of simplicity. 

Here and in the following, by "model" we denote a 
choice of inflationary potential, together with a specifi- 
cation of its free parameters, <dj, and of their prior prob- 
ability distribution, p(Qj \Mj). The specification of the 
prior is fundamental for model comparison, as the prior 
shape and range influence the Occam's razor effect. From 
Bayes' theorem, the posterior probability of model Mj 
given the data d, p(Mj\d), is related to the Bayesian 
evidence (or model likelihood) p(d\Mj) by 



p(Mj\d) = 



p(d) 



(12) 



where p(Mj ) is the prior belief in model M j . In Eq. (TT^I) , 
P(d) = ^2iP(d\Mi)p(Aii) is a normalization constant 
(where the sum runs over all available known models Mi, 



| In Soil 


Odds 


Strength of evidence 


< 1.0 


<3 : 1 


Inconclusive 


1.0 


~ 3 : 1 


Weak evidence 


2.5 


~ 12 : 1 


Moderate evidence 


5.0 


~ 150 : 1 


Strong evidence 



TABLE I: Empirical scale for evaluating the strength of evi- 
dence when comparing two models, Mo versus Mi (so-called 
"Jeffreys' scale" , here slightly modified following the prescrip- 
tions given in [34l . l53l]). The right-most column gives our 
convention for denoting the different levels of evidence above 
these thresholds. 
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is the Bayesian evidence, where p(d\Qj, Mj) is the like- 
lihood. The Bayesian evidence is thus the average like- 
lihood under the prior, and is the central quantity for 
Bayesian model comparison. 

Given two competing models, Mo and Mi, the poste- 
rior odds among them are given by 



p(M \d) 
p{M x \d) 



01 



p(Mq) 



(14) 



where we have introduced the factor Bqi as defined as 
the ratio of the models' evidences 



p(d\M ) 
p(d\Mi) 



(15) 



The Bayes factor thus updates our relative state of belief 
in two models from the prior odds to the posterior odds. 
Large values of B i denote a preference for Mo, and 
small values of Bqi denote a preference for M\. The 
"Jeffreys' scale" (Table fl} gives an empirical prescription 
for translating the values of B i into strengths of belief. 

Given two or more models, specified in terms of 
their parametrization and priors on the parameters, it 
is straightforward (although sometimes computationally 
challenging) to compute the Bayes factor. Depending on 
the problem at hand, semi-analytical [Hil . [55| and nu- 
merical [H, HO, |56T - |58| techniques are available. In the 
usual case where the prior over models is taken to be 
non-committal (i.e. p(Mj) — 1/N), the model with the 
largest Bayes factor ought to be preferred. Thus the com- 
putation of Boi allows to select one (or a few) promising 
model(s) from a set of known models. This framework 
has recently been extended to evaluate the probab ility 
that the set of known models is incomplete, see Ref. [59j ■ 

Finally, we can also summarize our findings in terms of 
posterior probability for the entire class of models being 
considered here, large field or small field. From Bayes' 
theorem, the posterior probability for e.g. the small field 
class (SF) is given by 



^ P (d|SF t )p(SF,) 



(16) 
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where 



p(d) = J2p(d\SF l )p(SF i ) + J>(d|LF J )p(LF J ) (17) 



and nsF = 3 is the number of small field models con- 
sidered in the class, while tt-lf = 6 is the number of 
large field models, as explained in the next section. Re- 
garding the choice of priors for the models, in view of 
comparing the viability of large field and small field in- 
flation, it is natural to divide equally the prior proba- 
bility between the two classes, and then further subdi- 
vide it equally among the models in each class, so that 
p(SFj) = l/(2n SF ) and p(LFj) = l/(2n LF ). For reasons 
that shall become clear below, it will be convenient to 
consider the Bayes factor between the various models and 
the large field model with p = 2 (LF 2 ), and it is therefore 
useful to divide both the numerator and the denominator 
of Eq. (fTBjl by the evidence of LF 2 , obtaining: 



p(SF|d) = --n^ 



£r B^piSFt) + E" LF Bj*p(LFj) (18) 



SF 



(-Bi*)sF + \Bi*)l,F 

-1 



1 



(Bit) 



LF 



\Bi*)SF 
where we have defined 



(B 



i*/SF 



-, «SF 

EBi*, 



"-SF 



= 1 



TlT T71 L ' 



"-LF 



(19) 
(20) 

(21) 
(22) 



and in the above £?» denotes the Bayes factor between 
model i and the LF 2 model. 

It is also instructive to consider the Bayesian complex- 
ity associated with each model, defined as [6(3] 



V KL (P, tt) - P KL 



(23) 



where, here, tt denotes the prior distribution and 
2?kl (P, tt) is the Kullback-Leiber divergence between the 
posterior P and the prior, tt, namely 



Vkl (P, n) 



p(6\d) log 



p(0\d) 
n(6) 



66. 



(24) 



In Eq. (f2"3"]) , D K l denotes a point estimate for the KL di- 
vergence. It has been shown in [34], HH that the Bayesian 
complexity measures the number of model parameters 
that the data can constrain. Evaluated together with 
the evidence, the complexity helps to assess whether the 
parametrization of a model is excessive for the constrain- 
ing power of the available data (for details, see 61]). The 
complexity can be expressed as 



V 



KL 



(x 2 



X 



(25) 



where x 2 



-21n£ and the expectation value is taken 



with respect to the posterior. The second term, % is a 
plug-in estimate that can be taken to be for example the 
best-fit x 2 value or the value of the x 2 at the posterior 
mean. Here we adopt the best-fit value, following 61]. 

As mentioned above, the evidence is computed using 
the publicly available MultiNest code [39l - [4l1 ]. which im- 
plements the nested sampling algorithm. The gist of 
nested sampling is that the multi-dimensional evidence 
integral of Eq. (fl"3")) is recast into a one-dimensional inte- 
gral. This is accomplished by defining the prior volume 
x as dx = p(<dj\Mj)dQj so that 



x(X) 



(26) 



£(e 3 -)>A 



where the integral is over the parameter space enclosed 
by the iso-likelihood contour C(Qj) = A. So x(X) gives 
the volume of parameter space above a certain level A 
of the likelihood (for a specific model Aij). Then the 
Bayesian evidence, Eq. (|13p . can be written as 



P (d\Mj 



C(x)dx, 



(27) 



where C(x) is the inverse of Eq. (|2"6"|) . Samples from 
C{x) can be obtained by drawing uniformly samples 
from the likelihood volume within the iso-contour sur- 
face denned by A. The standard deviation on the value 
of the log evidence can be estimated as (-ff/nnvc) 1 / 2 , 
where H is the negative relative entropy and nii vo is 
the number of live points adopted, which in our case is 
«iivo = 1000 (see Ref. [39[ for details). We have checked 
that our evidence values are robust (within error bars) if 
one increases nii vo to 5000. The posterior distributions 
have also been cross-checked with standard Metropolis- 
Hastings Markov-Chain-Monte-Carlo (MCMC). 



B. Choice of Priors 

Since our aim is to evaluate the evidence of large and 
small field models, it is absolutely crucial to choose well- 
motivated priors for the parameters describing the po- 
tential. In order to see why it is so, it is instructive 
to consider the evidence of a simple, one-parameter toy 
case, where there is only one single parameter 8, whose 
prior density under model M. is given by p(6\M). Wc 
shall further assume that the likelihood is much more 
sharply peaked than the prior {i.e. the quantity 9 has 
been well measured), so that p(ff) s» const, in the range 
80 where the likelihood C{&) is appreciably different from 
zero. Then the evidence of model A4, Eq. (ITS")) , is approx- 
imately equal to 



p(d\M) *£(8ml)69p(0ml\M), 



(28) 



where 9ml is the value that maximizes the likeli- 
hood function. Since the prior must be normalized, 
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Parameter 


Small field models, Eq. © 
SFI S SFI; SFIy 


Large field models, Eq. (|3|l 
LFIp LFI 2/3 LFIi LFI 2 LFI 3 LFI 4 


Normalization, InP* 
Exponent, p 

Vacuum expectation, log(/i/Mpi) 
Reheating, lni? 


[2.7 x 10" 10 ,4.0 x 10" 10 ] 
[2.4, 10] 
[-1,0] [0,2] [-1,2] 
[-46, 15] 


[2.7 x 10" 10 ,4.0 x 10" 10 ] 
[0.2,5] 2/31234 
Not applicable 
[-46, 15] 


n 


4 4 4 


3 2 2 2 2 2 



TABLE II: Inflationary models considered in this analysis and priors on their parameters. All priors are taken to be uniform 
(i.e. flat) in the variable and range specified, see the text for a detailed justification. In the last row, n is the number of free 
parameters related to the inflationary sector. 



p(9ml\M) ~ where £ is the characteristic width 

of the prior. Therefore one finds that p(d\M.) oc X -1 , 
i.e. the evidence scales inversely proportionally to the 
width of the prior. The term 66 /£ is the so-called 
"Occam's factor" , which penalizes models with a large 
"wasted" parameter space under the prior, i.e. models for 
which the characteristic width of the likelihood is much 
smaller than that of the prior, 69 /£ <C 1. Hence the a 
priori plausible range of parameter values determines the 
strength of the Occam's penalty term, and for this rea- 
son it has to be carefully chosen on the basis of physical 
considerations 2 . 

Going back to the potentials ((3]) and (j4]), we notice 
that the parameter M, common to both classes of mod- 
els, is a priori unknown, and is observationally deter- 
mined by the overall normalization of the power spec- 
trum, P*. Since the scale of M is unknown, it is ap- 
propriate to adopt a prior flat on InM, to reflect the 
fact that we are giving equal a priori probability to all 
orders of magnitude within some suitably chosen lower 
and upper limits. A flat prior on InM is equivalent to a 
flat prior on In P* , and therefore in our numerical sam- 
pling we swap InM for InP* as a fundamental param- 
eter. Since the overall power spectrum normalization is 
common to all models, the precise range of values un- 
der the prior for In P* becomes irrelevant (as long as the 
range is sufficiently wide to encompass the support of the 
likelihood), as all models share the same Occam's razor 
penalty from this common parameter. In practice, we 
chose InP* <E [2.7 x l(r 10 ,4.0 x lCT 10 ], but because of 
the above argument the Bayes factor between our models 
would remain unchanged even if this range was arbitrar- 
ily enlarged. 

For large field models, we chose to adopt a flat prior 
in the range 0.2 < p < 5. The lower limit is arbitrarily 
chosen to encompass all propos ed large field potentials 
having a fractional power [62, 63]. In principle, one could 
imagine an arbitrarily small p (which would suggest the 



2 Notice that parameters which are unconstrained by the data are 
not penalized by the Occam's factor, i.e. if the likelihood's width 
is similar to the prior range, then 86 /£ ~ 1 and the Occam's 
factor effect vanishes. 



use of a Jeffreys' prior, instead) but, up to now, there is 
no theoretical motivation to do so. On the other hand, 
there is no strong theoretical reason not to consider a 
model with, say, p — 7. However, we know that the data 
already strongly disfavour models with p > 5 (as a matter 
of fact, even models with p > 3 are disfavoured [23j]) and 
therefore one expects that the evidence of models with 
p > 5 (fixed) would be strongly disfavoured. Further- 
more, if one wanted to enlarge the prior range to p > 5 it 
would be easy to rescale the evidence to account for the 
enlarged parameter space, since the likelihood is close to 
for p > 5. This would lead to a larger Occam's penalty 
and thus to a lower evidence, see Eq. (|28|) . 

For small field models, we have chosen a flat prior 
p 6 [2.4, 10] as our representative class since p = 2 is a 
very special case. As discussed in Ref. [18j ]. approaching 
the value p = 2 is numerically tricky and we have cho- 
sen the lower bound as the closest, but different, possible 
value of p > 2. Models with p < 2 (gH, |65| might in prin- 
ciple be included but would constitute another class of 
models since this would require to cross the p = 2 barrier. 
Moreover, models with negative p correspond to very dif- 
ferent physical regimes. For instance, the model with 
p = —4 is nothing but the Coulomb potential of brane in- 
flation and was analyzed in detail in (6(| . For the reasons 
detailled in Sec.|H and at this stage of the analysis, we do 
not include those cases. The upper bound for p has been 
chosen typically an order of magnitude higher. Theoreti- 
cally, as already mentioned above, small field models are 
archetypal of inflationary potentials which can be Taylor 
expanded in the (small) field values, in units of a given 
vacuum expectation value fi. As a result, too large val- 
ues of p would appear quite unnatural. Concerning /i, its 
scale is a priori unknown and, therefore, we have chosen 
a flat prior on log(/i/Mpi) in the range [—1,2]. On one 
hand, if one has a theoretical prejudice of viewing the 
small field models as representative of Taylor expanded 
potential (as was done above), and in particular in the 
supersymmetric framework, one would expect /i < Mpi 
to keep the supergravity corrections under control. On 
the other hand, other theoretical approaches do not for- 
bid super-Planckian vacuum expectation values (67j since 
one can always consider that this potential is obtained, 
not from a Taylor expansion but exactly from a more fun- 
damental theory. The corrections would therefore not be 
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controlled by the ratio of the vacuum expectation value 
to the Planck mass but by the ratio of the energy density 
to the Planck density. Hence our prior range is chosen 
in such a way as to extend above the Planck mass. Con- 
cerning the boundary values, in the limit ji/Mp\ 3> 1, 
one can show that the two first slow-roll parameters, and 
hence all observable predictions, do not longer depend 
on both fi and p. As a result, it is straightforward to 
show from Eq. (|28|) that the corresponding Bayes factors 
would be unchanged for larger values of p. In the limit 
fi/Mpi <C 1, the first slow-roll parameter becomes tiny 
and the second one becomes /i-independent such that, 
again, the observable predictions, and thus the likelihood 
and the evidence, are no longer sensitive to p. 

Finally, in addition to the two broad classes of large 
field and small field models, we have introduced in our 
model space finer subdivisions leading to more specific 
model classes. Motivated by the above prior discus- 
sion, it is natural to further distinguish between small 
field models allowing super-Planckian expectation values 
[i.e. with log(/x/Mpi) > 0] from the ones that do not 
[log(/i/Mpi) < 0]. In the large field class, we have also 
singled out some models having a peculiar interest such 
as the genuine chaotic massive inflation model (p = 2), 
monodromy inflation {p — 2/3), linear inflation (p = 1), 
and the self- interacting potential (p = 3 or p — 4). Of 
course, one must restrict oneself to the positive part of 
the potential when necessary. Therefore we consider a 
total of 9 classes of models. 

Having parametrized the evolution of the universe dur- 
ing the reheating in the previous section, one must now 
discuss the choice of t he p rior on the reheating parame- 
ter. As shown in Ref. [18j . instead of working with i? ra d 
introduced in Eq. (7) . it is more convenient to work with 
the rescaled reheating parameter i? defined by 

1/4 

R = ^ad^f 1 • (29) 

Mpi 

As we recap in the appendix,, i? ra d exhibits trivial corre- 
lations with the normalisation of the power spectrum P» 
which can be easily removed by considering i? instead. 
Notice that once the inflationary model is specified, p en d 
is known and Eq. (|29|) is nothing but a rescaling. Clearly, 
the order of magnitude of the different physical quanti- 
ties appearing in Eqs. ([7]) and (|29p is unknown and this 
suggests that we choose a flat prior on lni?. The next 
step is to determine the prior boundaries. In fact, using 
the expression of i? ra d given before, one also has 

~ 12(1 + w lch ) 11 V Mpi / 6(1 + w rch ) n V Mm ) ' 

(30) 

Positivity energy conditions in General Relativity im- 
poses that u? rc h cannot exceed unity and we want to sep- 
arate inflation from reheating such that W re h cannot be 
less than —1/3. Moreover, p nuc < p IC \ L < p cn d, where 
Pnuc is the energy density at Big-Bang Nucleosynthesis 



(BBN), which we take to be plLt = lOMeV, this im- 
plies that -46 < lni? < 15 + (1/3) In (p end /M^). Since 
there is no preferred value for In R, we initially take the 
maximal possible theoretically allowed range [—46,15]. 
However, for each given model parameter values, we then 
reject all In R values not satisfying the consistency bound 
lni? < 15 + (1/3) In (pcnd/Af^). Finally, notice that this 
description of reheating via the In i? parameter and its 
prior range is common to all models. 

To conclude the discussion on priors, we have cho- 
sen flat priors on the standard cosmological parameters 
centered around their currently measured values, i. e. for 
the density parameter of baryons tthh 2 , of dark matter 
Jl,j m /i 2 , the angular size of the sound horizon at last scat- 
tering 6 and the optical depth r. We also marginalize 
over the amplitude of the unresolved SZ signal with a 
flat prior in the range A$z € [0,2]. These prior choices 
do not impact on our evidence result for the inflationary 
models as all models share the same standard cosmologi- 
cal parameters and their respective priors. We moreover 
assume throughout a flat universe as predicted by cosmic 
inflation. 

The models we consider and the priors on the relevant 
inflationary parameters are summarized in Table [II] 



IV. RESULTS AND DISCUSSION 

In this section, we present our model comparison re- 
sults for the classes of models described above. Concern- 
ing the data, we have used the seven years Wilkinson 
Microwave Anisotropies Probe (WMAP7) data [681470] 
complemented with the Hubble Space Telescope (HST) 
constraints on the Hubble constant today, iio = 74.2±3.6 
km/s/Mpc [Til ] . Our findings are summarized in Fig. [TJ 
where we show the Bayes factors for each model, com- 
puted with respect to the large field model with p = 2. 

Within the class of large field models, we can see that 
models with p > 3 are disfavoured, at the "weak evi- 
dence level" for p — 3 and at the "strong evidence" level 
for p — 4. Clearly, one can conclude that models with 
even larger (and fixed) values of p would be even more 
strongly disfavoured, so that they can be effectively ruled 
out. We have chosen the large field p — 2 model as our 
"reference model" (the one with respect to which the 
Bayes factors are computed) because it plays the role 
of a watershed point: large field models with shallower 
potentials are preferred by the Bayesian evidence, with 
p = 1 and p = 2/3 gathering slightly more than "weak 
evidence" in their favour. However, the evidence is not 
strong enough to allow one to conclude a definite prefer- 
ence for these models. The more generic large field model 
with p 6 [0.2,5] is also weakly preferred over LF2, and 
this despite the extra parameter of the former, which in- 
curs an Occam's razor penalty. As expected, the perfor- 
mance of this model, as measured by the evidence, falls 
in between the steep potentials (p > 2) and the shallower 
ones (p < 2). 
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SFI, (M>Mp,) 
SFI, O^Mp,) 
SFI, 

LFI p (0.2<p<5) 
LFI 4 (p=4) 
LFI 8 (p=3) 
LFI S (p=2) 
LFI, (p=l) 
LFI 8/8 (p=2/3) 



1.1 6 ±0.26 
2.29 ±0.25 
2.38 ±0.1 8 
1.20 ±0.26 
-5.45±0.26 
-2.13±0.26 
0.00 

1.45 ±0.26 
1.75 ±0.26 
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FIG. 1: Results for the Bayes factor between different inflationary models considered in the analysis. The names of the models 
are specified on the left of the figure. The Bayes factor are computed taking massive large field model as the reference model, 
and the results are given in the column on the right of the plot. The dotted vertical lines indicate the thresholds of weak, 
moderate and strong evidence, as per Table [I] 



Moving on to small field models, we remark that their 
overall performance is superior to our reference large field 
model LF2, but quite comparable to the shallower large 
field models, despite the fact that small field models have 
one or even two parameters more than large field mod- 
els. The very best models of inflation are small field. 
Within the error bars, the evidence cannot distinguish 
between a model with an upper cutoff at Mpi and one 
that allows /1 to go above the Planck mass. Models with 
purely super-Planckian expectation values are only very 
slightly disfavoured, by about 1 unit in the log evidence. 
Therefore we can conclude that the data are presently 
not sufficient to distinguish between the two scenarios. 

Further insight in the model comparison outcome can 
be garnered by investigating simultaneously the Bayesian 
complexity and the evidence (or the Bayes factor) of 
the models considered here (see Ref. 61] for further de- 
tails about the interpretation of the complexity). The 
Bayesian complexity, Eq. (1251) . has been computed for 
each model from a pure MCMC run whose convergence 
has been monitored by using the R statistics implemented 



in CosmoMC [42]. The chains have been stopped as soon as 
the estimated errors were below 3%, which corresponds 
to a total number of samples ranging from 5 x 10 to 
4 x 10 5 depending on the underlying inflationary model. 
The variance of our complexity estimate is obtained from 
the variance of four sub-chains of equal length randomly 
selected from the post burn-in samples. Both quantities 
are displayed in Fig. [51 where the horizontal axis gives the 
value of the number of input parameters for each model 
(both inflationary and cosmological) minus the Bayesian 
complexity, which we denote by the symbol ACb. A value 
of ACb close to zero means that the model parameters are 
well constrained by the data, while ACb > gives an es- 
timate of the effective number of parameters remaining 
unconstrained by the data. 

The value of ACb for the large field models with p > 2 
is generally smaller, and reaches ACb ~ for p = 4, 
the model with the lowest evidence. This is a conse- 
quence of the tension between these models and the data, 
which leads to the reheating parameter becoming more 
and more constrained as p increases: for p — 4, we find 
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a 2(7 lower limit lni? > —2.1, thus leading to an increase 
in the value of the complexity by about 1 unit. Since the 
models with p = 3 and p = 4 have the smallest Bayes 
factor while exhibiting values of ACb close to (meaning 
that all of their free parameters are well constrained) , we 
can conclude that those models are genuinely disfavoured 
by the data. On the other hand, for the models having a 
similar Bayes factor, Fig.[2]shows that the larger number 
of free parameters in the small field models corresponds 
to an increase in the number of unconstrained parameters 
ACb with respect to its value for the large field models 
with p < 1. This indicates that the extra inflationary pa- 
rameters in the small field class are not being constrained 
by the data. Therefore we are led to conclude that while a 
slight preference for small field models is beginning to ac- 
cumulate, it is too early to be able to conclusively favour 
small field models over large field ones. It is expected 
that Planck data will be able to conclusively pass judge- 
ment on this issue. 

A consistent picture emerges when one considers the 
Bayesian complexity of the two models with the largest 
number of parameters in each class, namely SFI/ and 
LFIp, with 4 and 3 inflationary parameters, respectively. 
For both cases, we find a similar complexity, Cb — 5.9, 
which suggests that current data can constrain up to ap- 
proximately 2 inflationary parameters. This is because 
our models have all N — 5 non-inflationary parameters in 
common, including the SZ amplitude, and 4 of them are 
well constrained and contribute approximately 4 units 
to the Bayesian complexity. This leads to the conclusion 
that WMAP7 data are still insufficiently powerful to fully 
constrain the whole inflationar y se ctor as parametrized 
in this work (see also Refs. [72l l73l|). 

We can also evaluate the posterior probability for the 
entire class of small field scenarios. From Eq. (| we 
find 



p(SF\d) ~ 0.77 ±0.03, 



(31) 



and, therefore, p(LF\d) ~ 0.23 ± 0.03. Therefore, the 
probability of the small field scenario has risen from 50% 
in the prior to 77% in the posterior. This represents 
posterior odds of ~ 3 : 1 in favour of small field infla- 
tion, as compared with large field inflation. Although, 
as explained above, this shift in the odds is by no means 
conclusive, it does represent an indication that large field 
inflation is getting increasingly under pressure from the 
data [z3- 

Finally, it is important to assess the robustness of our 
results with respect to reasonable changes in our choice of 
models' priors. Our choice to divide the prior probability 
equally between the LF class and the SF class reflects 
the desire to compare both classes of models on an equal 
footing a priori. Another natural choice for the models' 
prior would be to split the prior mass equally among 
models, i.e. to assign p(SFj) = p(LFj) = 1/(tisf + ^lf)- 
This choice would however result in prior odds of 2:1 in 
favour of the LF class, which seems contrived, given that 
it arises solely from the fact that we have double as many 



Moderate Evidence SFI, 

L l-^H SFI. <M<Mp,) 

Week Evidence 1 — 3p — 1 | j. | | ^ | 

LF], (p-1) LF1 P (0.2<p<5) SFI, (M>M n ) 

LF1, (p=S) 

Week Evidence 
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Strong Evidence 

-10 12 3 

Effective number of unconstrained parameters 



FIG. 2: Bayes factor versus the effective number of uncon- 
strained parameters (ACb) for all large and small field mod- 
els. The steeper LFI models are genuinely disfavoured by 
the data, as all of their free parameters are well constrained. 
Small field models being favoured by the evidence still have 
unconstrained parameters, and therefore it is too early to con- 
clusively rule out shallower (p < 2) large field models, despite 
the fact that they exhibit a slightly smaller Bayes factor. 




LF models as SF models. Even with this (unfair to the SF 
class) prior choice, the posterior probability for SF would 
be p(SF|d) ~ 0.6 [up from an initial prior probability 
p(SF) = 1/3], so our result of a (slight) preference for SF 
models stands. 

Finally, we notice that our result is robust with respect 
to the inclusion of further models under either the SF or 
LF class, provided such models are disfavoured by the 
data (as they would be e.g. for p > 5 in the LF class). 
Inclusion of such highly disfavoured models would result 
in their Bayes factors with respect to LF2 being close to 
0, hence the average values defined in Eqs. (f2"T1) and ([2"2")l 
would simply be rescaled by the new (larger) number of 
models in each class. However, the posterior probability 
of SF models only depends on the ratio of the average 
Bayes factors [see Eq. ([15])]. hence such rescaling fac- 
tors would largely cancel out (for a detailed discussion of 
this rearrangement of prior probability in a similar con- 
text, see Ref. [5!|). This holds true provided the overall 
number of models in each class is not widely different. 
We do not have any reason to believe that this should 
be the case. However, if one of the model classes truly 
had a much larger number of potential models in it, one 
would have to carefully reconsider the choice of giving 
both classes equal a priori mass: after all, a class of 
models with a smaller number of physically distinct pos- 
sibilities in it is a priori more predictive than a class with 
a large number of possible distinct models. 
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V. CONCLUSION 

To summarize, this article presented the first calcula- 
tion of the Bayesian evidence for different classes of in- 
flationary scenarios, explaining from first principles how 
physically meaningful priors could be derived for the fun- 
damental parameters of the models. Among the models 
studied here, small field models appear to be favoured, al- 
beit still in a fairly mild way. This result must be viewed 
as a first step towards a more exhaustive exploration of 
the inflationary landscape. With the techniques intro- 
duced here and the high accuracy CMB data soon avail- 
able, we have paved the way to the identification of the 
best inflationary scenario. 
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Appendix: Optimal reheating parameter 

As discussed in Ref . [23[ , the reheating parameter P ra d 
can also be explicitly related to observable quantities and 
to the number of e-folds N* at which a given scale fc* 
leaves the Hubble radius, i.e. when fc* = a(N*)H(N*) 
during inflation 



lnP rad ~ (A^ cnd — N*) + No — - In (8tt 2 P*) 

4 r K 



(A.l) 



where V* stands for the potential evaluated at the e-fold 
N*, i.e. when <j) = 4>(N„). The quantity r is the primor- 
dial tensor-to-scalar ratio and 



adopted recently in Ref. [23J where, in most of this work, 
the parameter lni? was used. Additional constraints 
can be further obtained if one introduces extra assump- 
tions on the equation of state parameter. This question 
was also studied in Ref. [23J, but only for specific cases 
(unlike the description of that work given in Ref. |75j ) . 
Let us notice that in Ref. (75|, the reheating phase was 
marginalised over by using a prior on AN* = N cn a — N* 
for a particular value of k* = 0.05 Mpc -1 . From the 
above formula, it is clear that one can always trade the 
parameter In R with AN* provided some values for r 
and P* are assumed, although it might seem awkward 
to introduce a scale dependent prior for a background 
quantity (the prior changes if one chooses another value 
for k* ) . A drawback of this approach is that this intro- 
duces correlations between the parametrization of reheat- 
ing and the normalization of the power spectrum when 
one has to determine the prior range. Reference [75[ 
fixes 20 < AN* < AN* nd , where AN° nd corresponds 
to the value of AN* when p cnc i = p rc h- It is easy to see 
that this choice excludes from the prior models that are 
physically legitimate. For instance, a small field model 
with p = 3, W rcn = —0.3 and fi = O.OlAfpi is such that 
17.2 < AA* < 46.0 (for this model, the lower bound is 
always smaller than 20 for /i S [0.01,10]). Of course, 
this also depends on the choice of p nuc which is not given 
very precisely. If one takes p nuc ~ (100 MeV) , the lower 
bounds become ~ 19.3 but if one chooses the extreme 
value pnuc — (lMeV) 4 , it is ~ 15.1. When constraining 
model parameters, these considerations does not really 
matter since, as shown in Ref. (23j, these model parame- 
ters are in fact disfavoured by the data. But, clearly, this 
will affect the calculation of the evidence. The above 
considerations show that priors should always be chosen 
and justified from physical considerations. 

Finally, in this article we have restricted ourselves to 
a standard post-reheating thermal history. But the ap- 
proach used here can in fact be straightforwardly gen- 
eralised to a non-standard thermal history before Big- 
Bang Nucleosynthesis (BBN). For instance, if one as- 
sumes that, inserted into the radiation-dominated era, 
there is actually a phase of evolution dominated by a 
fluid X the equation of state parameter of which is given 
by w x , one could define a new parameter P x by 



N n = In 



k*/ap 

4 Pro 4 , 



(A.2) 



Using the Friedmann-Lemaitre equations together with 
Eq. (|2"5)) . the rescaled reheating parameter now reads 



lni? ~ (N cnd -N*)+N + ± In 



(A.3) 



which clearly no longer depends explicitly on P* . There is 
thus a great advantage of sampling the model parameters 
on In R rather than In i? ra( j to prevent the unwanted de- 
generacies appearing in Eq. (|A.1[) . This was the approach 



lnP Y = 



1 — 3w„ 



12(1 



■In 



^start 



(A.4) 



where p^ tart is the energy density at the beginning of 
the epoch dominated by the fluid X and yO x nd the en- 
ergy density at the end. Then, nothing is changed in the 
above description except that the parameter P ra d should 
now be replaced with P ra dP x . Moreover, if after the 
X-dominated period, there is another, say, Y-dominatcd 
period, then one can define the parameter P Y and P ra d 
should now be replaced with P ra dP x P Y . Obviously, this 
works for an arbitrary number of new epochs. This also 
means that these non-standard thermal histories are not 
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really observable (unless one has a definite model for the fact completely degenerated with R la d- 
reheating) since the new parameters R x and R Y are in 
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